
DMA GENERAL 
CX)RPORATION 

Southboro, 

Massachusetts 01772 

(617)485-9100 



PROCS^AM 
Assembler 

TAPES 
Binary: 091-000002 



Copyright (C) Data General Corp. 1^c9 033-000017-02 



CONTENTS 

1 The Assembly Language 

2 Format 

3 Integers 
't Syftbols 

5 Special Atoms 

6 Operators and Expressions 

7 Location Counter 

B Comments and Statements 

9 Symbol 'definition 

10 Label and Equivalence Statenents 

11 Storage "ord Statements 

12 Basic Pseudo-ops 

Radix, location, block, end of tape, end, t.?xt 

13 Symbol Table rseudo-ops 
)k Operating Procedure 

APPEND I CrS 
A- Characters 
3 Pseudo-ops 
C Symbol Table 
D Error :Mncmonics 
E Listing Format 
F Object Tape Format 



IIJTRODUCTIOM 

The i.'OVA assembly program allows the programmer to v.'rite a source 
program in a symbolic, mnemonic language using the English alphabet, numerals, 
and other common characters. The assembler takes the source program as 
input (or more precisely the ASCII codes for the characters that make up the 
source program) and assembles it Into an object program. The output of the 
assembler is a complete listing of the source program on some output device 
and a tape of the object program ready for loading into memory by a binary 
loader. 

1 THE ASSEf'SLY LANGUAGE 
The assembler recognizes the codes for all ASCII characters, but null, 
line feed, rubout and form feed are transparent. The first three can be 
used in any way in the input and the assembler responds to the input tape 
as though those characters were not there at all; hence blank frames on the 
tape have no effect, and mistakes can be overpunched with rubouts. A form 
feed is eauaiiy transparent with respect both to the assembly of the object 
program and to the characters that appear in the listing, but it can affect 
the format of the listing. Throughout this manual a reference to "any 
character' means any ASCII character except these four. Of the remaining 
characters many can be used only to refer to themselves, ie to supply their 
own codes to the assembler rather than being used as symbols to represent 
something else. 

Hence the set of characters in the symbolic language is a subset of 
the ASCII character set, and this subset is listed in Appendix A. Sasically 
the characters in the language are used as operators, as punctuation, as 
elements in numbers, and as elements in symbols that orovide instructions 



to the assembler. Integers and mmeria symbols are character strings used 
as numbers. An integer, which specifies its own value, is a string of 
numerals. A symbol is a string of letters, numerals and periods whose 
value is either predefined (an initial symbol) or is defined by the source 
program. Symbols may be numeric and/or operational. An operational symbol 
tells the assembler to do something, purely operational symbols are called 
pseudo-ops. A symbolic address is an example of a purely numeric symbol. 
P.any symbols, such as the instruction mnemonics, are numeric in that they 
represent numbers, but they are also operational as they provide information 
to the assembler concerning the characteristics of the statements in which 
they appear. 

Integers and symbols are the basic units or atoms of the lanquage. 
There are also several special atoms that do not satisfy the definitions 
for integers and symbols. A double quote combined with any character can 
be used just as thcuch it were an integer whose value is the code for the 
characters. The special atoms also include the characters @ and # that 
are used for indirect addressing and to inhibit loading in an arithmetic 
and logical instruction. These characters can be used only in certain 
statements, but they are completely transparent in relation to grammatical 
structure, the' assembler responds to one of these in terms of assembling an 
instruction, but ignores it completely when determining the structure of 
the statement in which it appears. 

Operators are characters that specify arithmetic and logical relations 
among numbers^ ie integers and numeric symbols. A string of integers and 
symbols combined by operators is rin expression . Some characters are used 
5S punctuation to begin and end expressions, statements and comments, and 
to specify how parts of the source program are to be interpreted. 



The language has a formal hierarchy. Certain characters can be combined 
Into Intergers and symbols; integers and numeric symbols can be combined 
with operators into expressions. Using punctuation, expressions and purely 
operational symbols can be combined into statements. These are the 
fundamental macroelements of the language; they provide Instructions to the 
assembler, they define all symbols that are not initial, and they specify 
both the values of the words in the object program and the memory locations 
that will eventually receive them. Storage word statements and certain 
pseudo-op statements can generate output words for the object orogram. 
(Storage word statements are of two types, data statements and instruction 
statements.) Label statements, equivalence statements and the remaining 
pseudo-op statements are used to define symbols, locate the object program 
and control the assembly. Accompanying the statements are comments which 
provide commentary on the source program. 

As mentioned above, the characters null, line feed and rubout are 
totally transparent: the assembler completely ignores them and responds 
to a character string containing them as though they simply were not there. 
A form feed Is recognized by the assembler but only for format purposes (§2); 
In any other respect it is just as transparent as a rubout. The characters @ 
and # are also transparent with respect to grammatical structure even 
though they have a very definite effect on the generation of the object 
program-, when one of these characters appears in a statement the assembler 
responds to its presence only after evaluating the entire statement as 
though the character were not there. Among the characters used for punctua- 
tion, comma, space and tab, which are used to separate expressions in a 
statement, are grammatically identical and can be used interchangeably. 
Moreover In a string of these characters, all after the first are redundant; 



after encountering one such character, the assembler ignores them until 
some other character appears. Fol levying a carriage return all commas, 
spaces, tabs and further carriage returns are ignored until some other 
character appears. 

At the micro-level the source program is a long string of characters, 
but at the macro-level it is a string of lines separated by carriage 
returns. A single Una may contain a comment, any number of label state- 
ments, but no more than one statement of another type. In other words a 
carriage return may be followed by a comment or any l<lnd of statement; 
a label statement may be followed by a comment or any kind of statement 
including another label statement; but any statement other than a label 
statement must be followed either by a carriage return, which starts 
a new line, or by a comment, which in turn Is terminated by a carriage 
return. 

2 FORMAT 
The UOVA assembly language is format free. However, the listing of 
a symbolic source program has a very definite format. A listing is the 
output produced when the characters that comprise the source program are 
printed. Theteletype is the usual output medium, but a listing can be 
obtained on the line printer, and the string of characters in the listing 
can be punched in paper tape. The format of a listing is its visual 
appearance vlth respect to horizontal and vertical spacing, ie the use of 
spaces, tabs (tab settings) and carriage returns. 

Some format is intrinsic to the language because these format characters 
are used as punctuation, and the source program is automatically formatted 
into lines by the requirement that sometime after any statement other than 
a label statement, a carriage return must aooear before another statement 



can be given. Furthermore these characters can be used expressly to format 
the listing: all redundant spaces, tabs and carriage returns are interpreted 
only vvith respect to the listing format. Eg a logically redundant carriage 
return produces a blank line in the listing (although line feeds are 
ignored in the input the assembler automatically follows every carriage 
return with a line feed in order to properly space paper in the listing 
device) . 

Within broad limits, the programmer is free to determine the format of 
the listing for his program. All of these lines are identical as far as the 
assembler is concerned, ie they differ only in format but are identical in 
grammatical structure. 

LABEL: ADD# 2,3,SZP. :SKIP IF SUM ZEP.O 
LAEEL:A00,2,3,SZR#;SKIP IF SU" ZERO 

LABEL: ADD 2 3 # SZR ;SKIF IF GU'^ ZERO 
LABEL:,,,, #ADD, ,2, 3 ?Zf^;SKIP IF SUM ZERO 

LABEL :AD#D,2J, SZF ;SKIP IF SUM ZERO 

A common practice is to divide each line into four columns by means of three 
tab settings, using the left column for labels, the second column for all 
other statements with the arguments of an instruction nnemonic starting at 
a second tab setting, and the right column for program comments. This is 
the format of the first example above. If the listing device does not have 
automatic tabbing {eg the ASR33) , the assembler simulates tabs by spacing to 
the nearest assembler defined tab position (always leaving at least one 

space). These positions ars every eight columns, ie columns C', 17, 25, 

Although the form feed character is completely transparent as far as the 
assembly is concerned, it does affect the listing format. The assembler puts 
a form feed (and hence starts a new Dace in the listing) before any line 



in which that character is encountered. If the device is not equipped for 
form feeding, the function is simulated by line feeds, sixty Jines per page. 
In producing a listing the assembier actually orints out nore than just 
the source program. In each line of the listing the assembler first prints 
one-letter mnemonics (flags) indicating errors that have been made by the 

rtv/^n f smmaio ^Kan t-Uia -^AAirAC t: r\f ¥)nc% \ r\f^t \ r\n f-Kafr l*f i 1 1 rr\nt ^l r\ t* hp rthleCt 

word (if any) generated by a statement in the line, then the contents of 
that location (or if no storage word is generated, the value of whatever 
statement does appear in the line), and finally the line of the source program 
as formatted by the programmer. If the first instruction statement given 
above is assembled to be stored in location 'i^\h, the line would appear in 
the listing as follows: 

Q^k\k 157014 LABEL: ADD# 2,3,SZR ;SKIP IF SUM ZERO 
Following the program the assembler lists the values of the symbols defined 
by the programmer. 

3 INTEGERS 

An integer is a number computed in any radix from two to ten. The 
assembler converts each integer into one iG-bit unsigned number. The 
decimal integers to 32767 yield the octal numbers 000000 to 077777, 
the decimal integers 327f'B to v'5535 convert to 100000 to 17777- Using 
twos comolement conventions the program may treat the former words as 
positive numbers, the latter as negative. (The programmer can also 
generate signed numbers by using integers with operators as discussed in %( .) 

An Integer is any string of numerals that is preceded and followed by 
an operator or punctuation character and is neither in a program comment 
nor in a text string unless enclosed by angle brackets (§12). E^g the four 
strings 

3 38 9'J 12345673 



are all integers. (In all examples such as the above it is to be assumed 
tiiat appropriate delimiting characters, such as commas, spaces or operators, 
precede and follow each example.) ?ut the three character strings 

31.27 66A A123 
are not: the first two are illegal and would be flagged as number errors (N) ; 
the third is actually a symbol. 

The assembler assumes that all integers are octal unless the programmer 
gives a radix pseudo-cp to specify otherwise (§12). An integer that contains 
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number error. An integer greater than or equal to 2 is also flagged and 



is evaluated module 2 

k 3YMfOLS 

A symbol may have cither or both of two properties: a numeric symbol 
represents a iS-bit number; an operational symbol conveys information to the 
assembler. Some symbols are already defined before the assembly starts; these 
are known as initial symbols and include the instruction mnemonics and 
pseudo-ops. Other symbols can be defined In the source program as labels 
(which represent addresses), as other purely numeric symbols, or v.s opera- 
tional symbols that function like the instruction mnemonics. Operational 
symbols can be used to tell the assembler to do something; numeric symbols 
can be used as numbers in expressions. A symbol with both properties can be 
used tc initiate an instruction statement. It is then used as a number in 
evaluating the statement as well as being used to tell the assembler how 
to evaluate it. The difference between a numeric symbol and an integer is 
thct an integer specifies its own value, whereas the value of a numeric 
symbol must be looked up during assembly. 

Any string that begins with a letter or period and is composed entirely 
of letters, numerals and periods is a symbol if it is preceded and followed 
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by an operator or punctuation character and is neither in a program comment 
nor in a text string unless enclosed by angle brackets (§12). A period that 
by itself obeys these conditions is a special single character symbol v;hose 
value, which is determined each time it is usd, is equal to the current 
contents of the location counter (§7). The character strings 

r, 7 ^'>^^ Mi7^ii«;,<7R G.I ABC 

are symbols (the fifth is the special location symbol), but the strings 

1.27 123 LA$B 
are not: the first two do not begin with a letter or period (the second is 
actually an integer), and the last contains an illegal character. Although 
the assembler would flag the last string for a bad character (B) , it would 
interpret the string as two separate symbols. But depending upon the type 
of statement in which the string occurs, this interpretation would usually 
lead to other errors as well. 

Although a symbol can have any number of characters, the assembler uses 
only the first five to differentiate among them; in other words, all symbols 
whose first five characters are the same are indistinguishable to the assembler. 
Hence 

DITMASK eiTMA.7 BiriAQPRXJSK 
arc treated as the same symbol and can be used interchangeably. Long symbols 
are often used for clarity, but caution must be taken to ensure that symbols 
that are meant to be different actually differ in the first five characters. 

The assembler will accept the codes for lower case letters as input, 
but in symbols it simply translates then into upper case. Hence all of these 
symbols as source program input 

A3C0 ABCd abed AbcD abCd 
are equivalent to ABCD, which is the only form that appears in the listing. 



5 SPECIAL ATOMS 

Atoms in the assembly language correspond to words In a natural 
language. They are the strings of characters that are combined using 
operators and punctuation into epxresslons and statements. Besides 
integers and symbols, there are a few special atoms that have some of their 
properties but which contain characters that cannot be used in Integers 
and symbols. 

The character pair 

"x 
where ^ is any character other than rubout, line feed, form feed or null, 
is interpreted by the assembler as an Integer whose value Is the 7-bit 
ASCII code for the character x, provided the pair otherwise satisfies 
the conditions given for an integer. Hence giving the string 

Is the same as giving the octal integer 73, which is converted Into the 
octal word 000073. The character x is recognlzod only to the extent of 
using its value as an integer, and the preceding double quote destroys 
whatever operational value it may ctherv/lse have, eg as punctuation or as 
a user defined symbol. 

The other two special atoms are the symbols @ and #: the former is 
used to place a 1 In the Indirect bit of a memory reference instruction or 
address word, and can appear only in 3 statement that generates an output 
word of these types, the latter is used to place a 1 In the no-load bit of 
an instruction statement of that tyoe. These atoms are comoletely transparent 
with respect to the overall structure of any statement in which either appears 
and with respect to the structure of any other atom in the statanent. 
The appearance of either § cr # any number of times in a given statement 
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is equivalent to Its appearance only once, and its effect is exactly the 
same no matter where it appears in the character string that makes up the 
statement. The assembler first evaluates the entire statement as though 
the special atom were not there at all, and then ORs a 1 into the appropriate 
bit of the 16-bit result as indicated by @ or #. Hence all of these 
character strings are Interpreted by the assembler as being the same integer: 

§'♦673 '<'5§73 '*^73<^ '♦@67@3 

and all of these are interpreted as the same symbol: 

#ADDZL AD#OZL ADOZL# A#DD#ZL 

6 OPERATORS AUO EXPRESSIONS 
Operators are characters that specify arithmetic and logical relations 
among integers and symbols; both types of relations can be intermixed in one 
exoression. An expression is any series of integers and numeric symbols 
separated by operators. The term "expression" always includes the case of 
an integer or a symbol standing alone. As with all integers and numeric 
symbols, an expression has a l6-bit value, which the assembler computes by 
performing the indicated logical and arithmetic operations from left to 

right. 

An operator specifies an operation to be performed on the operands at 
either side of it. The operand at the left is all of the expression at the 
left, ie that oart of the whole cixnression from the beginning to the preceding 
integer cr symbol, the operand at the right is the next integer or symbol. 
Logical operators work bitv-'ise on pairs of operands; arithmetic operators 
treat operands as numbers. Mote that operands are intrinsically neither 
arithmetic nor logical: they are simoly Ic-bit numbers that are treated in 
different ways. 



The assembler interprets the following six characters as operators to 

specify two logical and four arithmetic operations with no checl< for overflow. 

Operatov Operation Interpretation of Operands 

+ Addition '.'nsigned l£-bit integers 

Subtraction Unsigned l6-bit integers 

* Multiplication Signed twos complement integers; result 

. is low order word 

/ Division Signed twos complement integers, result 

is one word, unrounded 

& Logical AND iC-bit logical words 

I Logical 0!^ iG-bIt logical words 

The plus and minus sign are additive operators, the others are product 

operators. An additive operator may tal<e either one or two operands, but 

in the former case the operator must be at the left in order to be meaningful. 

Actually the assembler assumes a zero operand at the beginning of any 

expression that begins with an operator and at the end of any expression 

that ends with an operator, but this can cause difficulty only with product 

operators--! t has no effect on additive operators. Consequently 

+A 

being equivalent to 

a+A 

is alright; the operator in 

. , i\- 
is meaningless but not illegal, and the expression is equivalent to, A. Note 
that an integer that is used to produce a negative number must haye a 
magnitude less than or equal to 2'^: ea the expression 

•■100001 
is not evaluated correctly but is not flagged as an error s ince thpr-e is no 
overflow checl<. The expression -x where x is greater than 2^ Is evaluated 
as 2 -X, which results In a positive number less than 2 ''. In the example 
given, the evaluation is 077777- 
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Expressions are evaluated from left to right taking one operand at 
a time; in evaluating 

A+B/C 
the assembler adds A to B and then divides the sum by C. If two operators 
are contiguous, the assembler assumes a zero operand between them. For a 
string of additive operators this means that only the final one is signi- 
ficant: 

A+-+-B 

is interpreted as 

A+0-O+O-B 

which is equivalent to 

A-CJ 
But with product operators you lose; 

A*-B 

is interpreted as 

A*0-B 
wnich is simply -B. To multiply A by -B the programmer must either give 

-B*A 
or define some symbol C as equal to -3 and then use the expression 

A*C 

7 LOCATION COUriTER 
As the assembler translates the source program into an object program, 
it not only generates the object words, but also generates information as 
to where they will be stored; for this purpose the assembler keeps a loca- 
tion count, \ihenever a storage word is generated, it is assigned to the 
location addressed by the current contents of the location counter. 
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At the start of an assembly, the assembler initializes the counter to 
location 0. During assembly the contents of the counter can be altered in 
several ways: 

The source program can set the counter to any desired 15-bit address 
by means of a location statement (§12). 

Every time a storage word is generated in the object program, the 
counter is incremented by one. Hence unless something else changes 
the counter, words are assigned to consecutive memory locations. 
(The location following 77777 Is 00000.) 

At the appearance of the pseudo-op .8LK the counter is incremented 
by the value of the argument of the pseudo-op (§12). 

The period, whan used alone, is a special symbol whose value is equal 

to the current contents of the location counter. Thus 

LOA 3,. +6 

is equivalent to 

LDA 3,6,1 
If the instruction is assembled at location 1215, it is also equivalent 
simply to 

LOA 3,1223 

8 COHMEIiTS AMD STATEMENTS 

As previously mentioned, a source program can be regarded as one long 
character string. Except for redundant carriage returns, tabs, spaces and 
commas, every character in the string either is part of a comment or state- 
ment, or terminates a comment or statement. 

A comment is not really part of the source program because it cannot 
affect the generation of the object program. Its only function is in 
conjunction with the source program I ist ing--a comment presumably explains 
something related to the portion of the program where It appears. A semi- 
colon as a statement terminator or as the first significant character 



following a statement or comment terminator indicates the beginning of a 
comment; the comment terminates with the next carriage return. Any 
character except carriage return, but including semicolon, can be used in 
a comment. Of course a control character produces no printable output--it 
has its given effect (if any) on the listing device at the point that it 
appears. (Remember, a form feed is executed prior to the line in which it 
appears and cannot be part of a comment) . 

Statements in the assembly language correspond to the statements or 
sentences in a natural language. A statement either defines a symbol, 
generates a word in the object program, or supplies information to the 
assembler. The next three sections describe the four types of statements: 
label statements, equivalence statements, storage word statements, and 
pseudo-op statements. 

A statement terminator is a character that ends a statement but is 
not itself part of the statement. Ho character is used to indicate the 
beginning of a statement. Instead a statement is assumed to begin with 
the first significant character that follows a statement or comment 
terminator, provided this character is not a semicolon (which indicates the 
beginning of a comment). '\ statement that contains a single undefined 
symbol termiri.ited by a colon is a label statement. Every other statement 
is terminated by o semicolon or a carriage return. An equivalence state- 
ment begins with an undefined symbol followed by an equal sign, a pseudo-op 
statement begins with a pseudo-op. A statement that is none of the above 
.s taken to be a storage word statement, and the assembler inspects the 
first nontransparent atom in it to determine the type. If it begins with an 
integer or a purely numeric symbol, it is a data statement and can contain 
only one expression; if it begins with an instruction mnemonic or equivalent, 
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it is an instruction statement and the number of expressions it may contain 
depends upon the instruction class to which it belongs. In determining the 
structure of a statement or evaluating it, the assembler ignores all spaces, 
tabs and commas that immediately precede the statement or its terminator, 
or precede or follow an equal sign. ■ : - ■ 

3 SYMBOL DEFINITION 

A symbol is said to be defined if the assembler has a value for it. 
The value of a numeric symbol is the l6-bit number it reoresonts; the value 
of an operational symbol is its meaning. Some symbols, such as the instruc- 
tion mnemonics, have both numeric and operational properties, for such a 
symbol to be defined the assembler must both have a numeric value for it 
and also know its meaning. All symbols that appear in a program must be 
defined. The initial symbols are predefined and hence already have values 
at the start of the assembly. The source program can define a symbol as a 
symbolic address by means of a label statement, as a numeric symbol by 
means of an equivalence statement, or as a symbol that may have both numeric 
and operational oropertias by neans of certain pseudo-op statements. 

The assembly of a source program is done in two passes, ie the assembler 
goes through' the entire character string twice. The first pass locates the 
entire program and determines the definitions of all symbols. Hence the 
assembler must be able to evaluate oil symbol -defining statements. in the- 
first pass. This means the source program cannot use a pseudo-op or 
equivalence statement to define A as a function of the symbol B unless the 
statement that defines S appears earlier in the source program, in order 
to define all symbols and locate the program, the assembler must also be 
able on the first pass to evaluate all statements that indicate how integers 
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are to be Interpreted or that altar the normal consecutive sequence of the 
location counter. Hence the assembler must be able to evaluate any expression 
that appears in a radix, location or blocl< pseudo-op statement. If two 
or more stataments define {ie assign values to) the same symbol, every 
occurrence of the symbol Is flagged as multiply-defined (M) • 

Mb part uT tnu dsscmuit;! a i u i l i c- i i to u i vjh , i u o»-«.w ~- -■•- , 

size of the configuration in which it is running. This enables one 
version of the Assembler to run in all memory sizes efficiently, building 
its symbol table upward until the memory capacity is reached. An attempt 
by the program to 'define more symbols th.an the assembler can accomodate 
in the area of core set aside for them results in ;( symbol table error (S) , 
and the assembler will accept no more symbol definitions. 

The assembler evaluates all other statements in the second pass. Any 
symbol whose value is not known to the assembler when it is encountered in 
the second pass or in an expression th^t must be evaluated in the first pass 
is flagged as an undefined symbol (U) . A symbol whose value in the second 
pass differs from its value in the first pass is flagged as a phase error (P) . 

10 LABEL AND EQUIVALENCE STATEMENTS 

Only numeric values can be -assigned to symbols by label and equivalence 
statements. These statements are evaluated in the first pass and must be 
used to assign values to symbols that are not d.-;fined elsewhere. 

A label statement follows a carriage return or colon, consists of 
one symbol that has not been defined previously in pass 1, and is terminated 
by a colon. The statement defines the symbol, and its value is taken from 
the current contents of the location counter. Ordinarily a label statement 
is used in conjunction with a storaqe word statement. If the latter 
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Immediately follows the former, the label provides a (symbolic) address for 
the memory location that will receive the storage word when the object 
program is loaded. If the storage word statement 

LOA 2,30 
is immediately preceded by a label statement, say 

LOOP: 
^■Q If the coding is 

LOOP LDA 2,30 
or equivalent, then the storage word statement 

JMP LOOP 
is assembled to produce a jump to the same location that receives the storage 
word LDA 2,30 (provided of course that location LOOP is in page zero or within 
range of the location containing the JH? LOOP (sec; 511)). A previously 
defined symbol terminated by a colon Is recognized as a label statement, and 
the symbol is redefined and flagged (■') . A label -type statement containing 
other atoms besides an undefined symbol is flagged as a colon error (C) . 

An equivalence statement follows a carriage return or colon and uses an 
equal sign to define the symbol at its left by assigning to it the value of 
the storage word statement at Its right. These f^r^ all legal equivalence 
statements. 

A = 3^2 
3 »A/2 

C= SZC+17-A/11-B 
= LDA 2,350,3 
E=A0D2-$MC 
The symbol at the left must be previously undefined in pass 1, and the 
statement at the right must be capable of evaluation in poss 1, ie any 
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symbols in it must already have boon defined (on undefined symbol is 
flayged as a equivalence orror (E)). The statement at tha right of the 
equal sign is not really a storage word statement in that the assembler 
does not actually generate a storage word from it, but it must be recognized 
by the assembler as equivalent to such a statement. Hotc that in the last 
example, the statement at the right is recognized as s storage word state- 
ment for a format error (F) but would assemble it correctly, ie would assign 
the actual value of the expression at the right to the symbol at the left. 
An equivalence statement terminates with the first semicolon or carriage 
return, but any expression following a complete storage word statement 
after thv". equal sign is ignored. Any number of tabs, spaces or commas 
at either side of the equal sign are also ignored. 

iJeither a label statement ncrc an equivalence statement has <?ny effect 
on the location counter. Beginning at location 1322, 

LOA 1 , . 

LDA 2,. 
B; C; 
D: LDA 3,. 

A 
is assembled as equivalent to 

LDA 1,1322 

A=1323 

LDA 2,1323 
B: C: 
D. LDA 3.132^ 

1323 



w 



hf>.n 3. C and D are all asslaned the value 1324. 
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11 STORAGE WORD STATEMENTS 
A storage word statement generates the output for one word to be 
stored as part of the object program. Except for a text or end pseudo-op 
statement, only this type of statement actually produces output, although 
other types can affect the value of the 16-bit word produced. The following 

135602 

IS2 §A+C,2 

STA 3,D 

CnM# M,SZR 

A+3/C'VD 

@3720+E 

DIAG 2,PT;^ 

HALT 
The current contents of the location counter designate the memory 
location that is to receive the word when the object program is loaded. 
The counter is incremented every time a storage word statement is processed, 
so the words generally are assigned to consecutive locations unless the 
counter is changed by a location or block pseudo-oD statement. 

A statement that is not a label statement and does not contain an 
equal sign or a pseudo-op is assumed to be a storage word statement 
that is terminated by the first semicolon or carriage return. The assembler 
examines the first nontransparcnt otom in the statement to determine the 
type, and hence the maximum number of expressions or fields the statement 
can contain md the minimum number it must contain. A statement with fewer 
than the minimum or more than the maximum is fl-t^gged for a format error (F) , 
but the assembler ignores any expressions beyond the maximum allowed in it. 
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A transparent atom can appear anywhere from the first character to the 
last before the terminator, when one is used, the assembler first 
evaluates the statement without it, and then adjusts the result for the 
special atom by ORing a 1 into the appropriate bit (hence it has no effect 
if the bit is al ready 1) . 

Data 
If the first nontransparent atom in a storage word statement is an 
integer or a purely numeric symbol, the assembler takes i t as a data state- 
ment containing a single expression. In the above examples the first, 
fifth and sixth are data statements. !n such a statement the special 
atom @ can be used In generating a full word indirect address. Since it 
has its effect after the statement is evaluated, all of these data state- 
ments have the same value, 

1026i4i» 

102644© 

2644@ 

1322*2 3@ 
although the last one is flagged for a format error. Remember that the 
special atom. is neither an integer, a symbol, nor an operator, and there- 
fore cannot be part of an expression. Hence either of these, 

100000@ 

@0 
is a data statement whose value is 100000, but this, 

is not. In other words a storage word statement must contain at least one 
expression — § by Itself does not suffice. 
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A statement being taken as a data str3temcnt docs not mean that the 
object word gLnerattd by it is ncces3~rily an operand in the progran. A 
data statement is simply a way of representing a 15-bit vi^ilue; it need not 
be used as an operand anymore than a number generated by an instruction 
mnemonic need bo executed as an instruction, ^'p' the first example of a 
storage word statement given at the beginning of this section, 

135^'' 02 
if executed as en instruction would be equivalent to 
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lyis t-vxio t-i ons 
If the first nontransparent atom in a storage word statement is an 
instruction mnemonic (or equivalent), the assembler tp|<es it as 5n 
instruction statement, determines the class to which it belongs (mer^ory 
reference, with or without accumulator; arithmetic and logic; input-output, 
with or without sccumulator) and therefore the number of fields in it. 
In ganeral an instruction statement is made up of a mnemonic field followed 
by a number of argument fields, the standard procedure is to separate the 
argument fields from the mnemonic field by a tab and the argument fields 
from each other by commas, but since tib; sp^-cz and comma are equivalent, 
they C3n be used arbitrarily as field secarators. Every field is ?n 
expression which is evaluated in the normal way. The mnemonic field should 
be simply the mnemonic, which specifies only certain bits in the instruction 
word, but results in a lo-bit value {eg X.\Zll is evaluated as 103530). 
A mnemonic field containing more than just a mnemonic is fl-igged as a 
format error but is evaluated correctly. In any ovent the expression must 
begin with a mnemonic: so one could give 

ADDZL+400 
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which (although flagged with an F) is equivalent tc 

lOBUO+ijOO 
ie to ANDZL. The argument fields represent other parts of the instruction 
word, such as an accumulator address or a sl<ip function; and their effects 
arc limited to those parts of the word-"each argument field is evaluated as 
a 16-bit number, but the assembler evaluates the total statement by tc?l<ing 
only the necessary low order bits from each argument value and O^.ing them 
into the appropriate bits for the total statement value. If the statement 
contains a transparent atom (which is not regarded as a field even though it 
represents a specific part of the instruction word), its value is ORcd into 
the result after evaluating all fields. 

Although the assembler masks out unnecessary bits in the values of 
field expressions, it flags as a field overflow error (O) any AC address or 
index field whose value is gr.-.?ter thir, 3, any skip field whose value is greater 
than 7, and any device field whose value is greater than 53. The assembler 
flogs as a format error (F) any instruction statement that contains a 
transp^-^rent atom '-,'hen none is allowed. An overflow error also results if 
any argument field requires th_ assembler to place a nonzero number in any 
field of the storage word that has already received a nonzero number due to 
the evaluation of the mnemonic field; ea except for incorrect format in 
the second, th-se two statements are equiv:i'li::nt 

AND 0,2,SKP 

ANO+1 0,2 
so this statement results in both format and overflow errors: 

Ai^D+1 0,2,SKr 



Memory Reference. A statement for an instruction that re;forenccs 
memory has one or the other of these forms depending upon whether it 
requires an accumuJator. 

f^tnemonio Address, Index (optional) 

Mnemonic Aaaumulatorj Address, Index (optional) 

The mnemonics for these two forms arc -.'s follows. 

Without Accwrtulator With Accumulator 

Jf^P LDA 

JSR STA 

ISZ 

DSZ 
The transparent atom @ may be used to indicate indirect addressing; it is 
usually ol.3Ced immedigtaly before the address field. 

The index field determines the action the assembler takes with respect 
to the address field. Lot L be tb- current value of the location counter, 
A the value, of the address field, and X the value of the index field. 

Index Action 

or Blanl< (te I P .4 < 377, piece 00 in bits 6-7 and a "" bits 8-15 
no expression) {oajo zero addressing). If i - 200 <^ A <L + 177, 

place 01 in bits i-1 and A - L ]r\ bits 8-15 (relative 

^ddrv,r,s ing) . 
1; 2. or 3 If -200 ^A <_ 177, pl'^cc X in bits t-J and a in 

bits 8-15. 
If th(j condition associated v/ith a given index value is not satisfied, the 
assembler flags the statement for an address error (A) and places the low 
order bits in the di solacement and index parts of the instruction word as 
shown in this flow chart. 



2h 




00-vBITS 6-7 
/1->BITS 8-15 



NO 



NO 




YES 




NO 



0K2ITS 6-7 

A-i>BITS8-15 



y 



FLAG STATE- 
MENT FOR ADD 
RESS ERRORS 




Z-^8ITS 6-7 
;1^&ITS 8-15 



Arithmetic and Logic. A statement for an instruction in the arithmetic 

end logical clasps has this form 

I'nemomc Source AC^ Destination AC, Skip Function (optional) 

The instruction end skin mnemonics for this class are as follows. 

Skip 



Instruction 




f L 


KOV 


R 


ADC 
ADD 


r 1 c 


A- 10 ^ 





SK.F 
SZC 

sue 

S.7R 

s;jR 

SFZ 
SSN 



The transparent atom * may be used to inhibit the processor from loading the 
instruction result into the destination accumulator; it is usually placed im- 
mediately after the instruction mnemonic. 
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Input- output, A statement for an instruction in the in-out class has 
one or the other of these forms depending upon whether it requires an 
accumulator. 

Mnemonic Device 

i^-fnemonic Accumulator;, Device 

The mnemonics for these two forms arc as follows. 

Without Accumulator With Accumulator 

S 
C 



NIC Jc 




SKPSM ncB / )^ 

SKPBZ i:>ic ^° 

SKPDH DOC J 

SKPDZ 

Special Mnemonics. The assembler ?ilso recognizes some special 

instruction mnemonics which combine t basic in-out code with the central 

processor device code. This elimin-ites the need for a device field, and 

two of them even eliminate the usuclly required accumulator field. 



Special 

llnemonic 


Equiva 


lent 


Required 
Arguments 


READS 


niA 


O.CPU 


Accumulator 


lORST 


Dice 


0;CFU 


Mone 


'HALT 


DOC 


0,C''U 


^Jone 


IMTEN 


NIOS 


CPU 


'ione 


h^iTDS 


M 1 OC 


CPU 


?'one 


INTA 


Die 


O.CPU 


Accumulator 


",SK'^ 


DOB 


O.CPU 


Accumulator 



Hence the assembler recognizes 

READS 3 

as a storage word statement equivalent to 

DIA 3, CPU 
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and ignores anything after one argument fiold in a statement beginning 
with READS. 

The pscudo"Ops discussed in §13 allow th-., programmer to define symbols 
that will then be accepted by the assembler as equivalent to instruction 
mnemonics . 

Floating Point Instructions. The programmer must remember that it is 
illegal for him to define symbols that aro identical to the initial symbols 
The initial symbols include the instruction mnv^monics listed in Appendix D 
of How to Use the Nova and also the floating point instruction mnemonics 
that are assembled like ordinory instructions but for execution by an 
interpretive routine. The floating point mnemonics are explained in the 
writeup of the Floating Point Interpreter program and are listed below. 

Instruction Mnemonics Option linemonias 

FADO FIfll FSGE 

PALG FISZ FSGT 

FATN FJMP FSKP 

FCOS FJSP FSLF 

FDFC FLD3 f'SLT 

FDFCI FLDA FSNR 

FDIV FMCV FSZR 

FDSZ FMMS 

FETR FiU'Y 

FEXP FNE'^ 

FEXT FPOS 

FFDC F^MD 

FFOFF FSIM 

FFIX FSQR 

FFLO FST3 

FHLV FSTA 

FIC2 FSUD 

FIC3 FT AN 
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12 3ASIC !^$EUnO-OPS 

A pseudo-op is a purely opcr^ticngl symbol. Such symbols arc commands 
to thv? assembler rather than symbolic representations of numbers; they can 
affect the internal operation of the assembler, generate portions of the 
object program, and define symbols (the last type is treated in the next 
section), "os t pseudo-op statements h?vc the form of a pseudo-op followed 
by one argument. 

Radix. At the beginning cf each pass the assembler stnrts by intcrprot imj 
integers as octal. The source program can change the radix by giving a 
statement of the form 

• ^^OX Expression 
where integers in the expression =re alvcys interpreted as decimal. The 
value of the expression becomes the novv radix for integer evaluation. If 
the expression cannot be evaluated in pass 1 or its value is less than two 
or greater than ten, the assembler flags the statement for a radix error (D) 
and continues to use the previous radix. 

This example of source coding illustrates the effect of the radix 
psGudo-op on the octal values of exoross ions . 
Location Value Statement 





000002 


.RDX 2 




00000 


000037 


ioi:ii;ni 


;5 ORE 




000003 


."xDX 3 




00001 


300013 


21 + 11 


■J+k 


00002 


000006 


12'^- 12/ 11 


;5x5^'t 




000012 


.■'DX 10 




00003 


000115 


77 




OOOOij 


000077 


63 




00005 


000037 


3^^8/3+7 
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Location. "hcni::Vcr the ssscnbl-r ..ncount.jr5 •■? statement of the form 

.LOC Expression 
it sets the location counter to the value of the expression. If the 
expression cannot bu evaluated in pass 1 or its value exceeds 32;7^'7^q 
{ie produces sn iddross of nore then fifteen bits), the assembler flags it 
for c location error (L) -^nd ignores the statement. In other words an 
erroneous location statement his no effect on the location counter. 

When resetting the counter, the programmer should be careful not to 
produC'^ an overlap, as in 



Location 








002 U; 




13 KO 


ri 


00215 




I ['1 A 


n, TTSAV 


002 ir 




IMTP,! 




00217 




JMP 


@TTSAV+1 


00220 


TTSAV -. 







00221 




C 




00222 




•y 






.LOC 


220 




GC220 


PTFIi: 


?TA 


^,PP?AV 


00221 




ST.". 


1 ,PPSAV+1 


00222 




'■-fx 


2,PPS/'V+2 



The assembler would '::5sign the zero vjords to locations 220-222. But 
resetting the counter to 220 causes the next three instructions to be 
assignee to the same three, locations with no error diagnostic, ''hen the 
object program is loaded the zero datv- words ar.- loaded first, but arc 
replaced ty th^^ STAs when thuy are loaded. Furthermore, as soon as tho 
program saves something in the TTSAV locations, the STAs are destroyed. 



A location statement can bo used to reserve o block of storage. The 
following example allocates a block of twenty locations for a table wherein 
the first location in the table is labeled TAB20, and the first location 
after the table is labeled TEMD. 

TAB20: .LOC .+2^ 

TEHO: 

Block. This pseudo-op Is used explicitly to allocate blocks of 
storage. At the appearance of a statement of the form 

. BLK Expression 
the assembler incrrments the location counter by an amount equal to the 
v?luc of the expression. A location error (L) results if the expression 
csnnot be evaluated in pass 1 or its value when rddod to the current value 
of the location counter exce^-ds 2'^ - ]. 

This line of source coding reserves a block of six words starting at 
location BLK^ . 

SLKo. .3LK 2*3 
End of Tape. It is sometimes necessary to continue a program onto 
another source tape. Upon encountering the pseudo-op 

.EOT 
the assembler stops the source input device and halts with OOOOo in the 
address lights. The assembly can be continued by loading a new tape and 
and pressing the console continue switch. 

End. The final statement "" a source program must be one of these, 

.EMD Expression 
.END 
and the line in vjhich it appears (including a comment if any) must be 
terminated by a carriage return. If the pseudo-op has an argument, its 
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value is taken as the starting address of the program just assembled; 
after reading in the object tape, the loader automatically starts the 
execution of the program at the location given. If there is no ?rgument, 
the loader halts after loading the object program. 

Caution 

An end statement rrust be followed by a aarriage retuxm. 

Omission of this character causes the assembler to act as 

though it had encountered an end of tape statement instead. 

It will thus halt and writ for further action by the operator. 

Text. To store the octal codes for a string of characters packed two 
to a word, the programmer can use the text pseudo-ops. The basic text 
statement is of the form 

.TXT E,text string Z 
whore 5 is any character other than carriage return, space, tab, comma, 
null, line feed, form feed or rubout, and which does not appear in the 
text string. Upon encountering the pseudo-op .TXT, the assembler takes 
the next significant character rther than a carriage return as the text 
delimiter, and then assigns succeeding pairs of characters to consecutive 
memory locati-ons until it again encounters the delimiter. If the string 
contains an odd number of characters, the final one is paired with a null 
character; if an even number, a null word is assigned to the location 
following the string. This provides a convenient means for an output 
routine to detect the end of the string. 

As usual, null, line feed, form feed and rubout are not regarded as 
elements in the statement; but from the timf the assembler encounters the 
first delimiter until its second occurrence, carriage returns are also 
ignored (of course a carriage return preceding the first occurrence 
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terminates the statement). Hence when the programmer is preparing an 
input tape on an ASR, he cr-.n continue the text string onto additional 
lines on the teletype p^ipcr without introducing spurious carriage returns 
into the statement. 

The programmer can introduce any character, v-^vcn a rubout, into the 
text string by enclosing nn expression for it in angle bracl<ets; hence 
angle bracl<ets cannot themselves ippcar -:s characters in the string. Upon 
encountering e left bracket., the -rissembler evaluates the expression contained 
between it and the next right bracket, -md takes the low order seven bits 
of that value as the ASCII code for the next byte to be packed. Thus to 
store the sentence 

GO TO <h'> 
the programmer can give a text statement of the form 

.TXT §00 TO <7i»>lh:<76>@ 
or, if he cannot remember the codes, 

.TXT §G0 TO <''<>i;'j<">>§ 
The example just given v/ould appear in the listing this way. 

.TXT fflf^O 
T 




The assembler generates 8-bit bytes, rrade up of the 7"bit ASCII code and a 
leftmost bit of 0, and packs them from right to left in the storage words. 
Our example would thus produce the words 
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OG 0^*7507 

T 0320itO 
020117 
|< O^'j'47^ 
>M 03711^ 
00 003000 
Altogether there are four forms of the text pseudo-op that very the 
disposition of the leftmost bit in the 8-bit bytes generated from the text 
string. 

Fseudo-op Effect on Left Bit 

.TXT Left bit is 0. 

,TXTO Left bit is odd parity fcr the byte. 

.TXTE Left bit is even parity for the byte. 

.TXTF Forces left bit to be 1. 

The assembler initially packs text bytes from right to left unless the 
programmer gives a text mode pseudoop. After the .ippesrance of the statement 

.TXT'^ Expression 
with a nonzero expression, the assembler uses left-right packing for any 
text string it encounters. The programmer can switch back to right-left 
packing by giving .TXTM with ?. zero argument. Eg in this sequence, 

.T;'T^< 
.TXT /'•/ 
.TXTM 1 
.TXT /;./ 
the second statement generates the storage word 000101, the fourth generates 
G^iO'^OO. 
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13 SYMBOL TABLE PSEUOO-OPG 

By using certain pseudo-ops the programmer can define symbols I il<e 
the special instruction mnemonics (such as ITEN and MSKO) , ie symbols 
that include an instruction mnemonic and other fields of an instruction state- 
ment as well. The genera! form of a symbol -defining pseudo-op statement is 

Vseudo-o^ Equivalenoe Statement 



ve 



Pseudo-op Undefined Symbol = Storage Word Statement 

The simplest of these pseudo-ops, .DUSR, defines symbols which retain 
no operational properties; in other words it acts just like a simple 
equivalence statement insofar as the value of the symbol is concerned. Defin- 
ing CNT by 

.DUSR Cf!T = 2'* 



means that 



is equivalent to 



STA 2 , CiMT 



STA 2,2'* 



Simi larly 



allows us to give 



DUSR RDR = DIAS 0,pTR 



RDR 
to bring in 3 character from the reader to ACO and start the reader again. 
But we cannot give 

RDR 1 
A symbol defined by .DUSR has no operational properties and therefore can 
take no arguments; the last example would be flagged for a format error (F) . 
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The other six pseudo-ops of this tyoc dcfino symbols with operational 
properties. Typically each pseudo-oD f3llows the programmer to define a 
symbol as equivalent to an instruction statement in which certain fields 
3re zero. The symbol is then used with arguments corresponding to the 
zero fields in the definition. Suppose we often have to compare the 
magnitudes of unsigned numbers in the accumulators. We could use .DALC for 
these def ini tions . 

.DALC SL = SU3Z# 0,0, SZC 

.DALC SLE = ADCZ# 0,0, SZC 
Then 

SL x,y 
skips if hZy < ACx, and 

skips if ACj/ <_ ACa-. In other words these newly defined symbols act just 
like instruction mnemonics in the -jrithmetic and logical class. The 
number of arguments given with the symbol plus the number absorbed in it is 
equal to the number tha original mnemonic takes. With a symbol defined 
by .DALC, a skip field is optional if none was given in the definition. 
The effect of a transparent atom can accompany every use of a symbol by 
giving it in the definition, or it can be given at the programmer's 
discretion when the symbol is used. 

Now even though a syrrbol defined by .DALC has certain operational 
properties (specifically taking certain arguments), the storage word state- 
ment in the definition need not have any. Hence 5L could just as v;el1 be 
defined this way: 

.DALC SL = 102432 
and i-t would still require two accumulator arguments in use. Also, the 
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argument fields rcqulrad in use need not be zero in the def in! tion"--thc 
restriction is that the programmer must not attempt to put a nonzero 
quantity in the same storage word field twice. Thus to have a convenient 
symbol for testing whether AC1 is less than some other accumulator, we could 
define TEST by 

.DALC TEST = SU3Z# 0,1, SZC 



or squivalently 



• DALC TEST = 1C6i.32 



rvy^ 



TEST 2,0 
skips if AC1 < AC2. liote that the required arguments must be given even if 
zero. 

The table on page 37 lists -11 -f these ps .ULC-rps , th.. tyn.s -f 
symbols they can define, and for onch type, the arguments that must 
accompany the symbol when it is used. Optional elements are indicated by 
square brackets. 

The programmer can specify certain parts of ALC and 10 instruc- 
tion words by oopending letters to the basic three-letter 
mnemonics for these instruction'-. This property is retained 
by the equivalent symL'.ol types defined by the pseudo-ops 
discussed here. Eg if the orcgrammer uses .DALC tc define a 
symbol whose fourth character is I ., whenever the assembler 
encounters a stateme.-it in which that symbol is used, it will 
place Is in bits 8 and 9 of the storage word generated from 
the statement (just ^is it would if the programmer used ADDS 
or f.'E'^S) regardless of the value assigned to the symbol by 
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the .DALC statement. Hence unless the orogrammcr actually 

wishes to us^ this function of the assembler in generating 

storage words, he should avoid the following: 

Using L, R, S, Z, or C as the fourth character in a 
symbol defined by .DALC, 

Using L, R or S as the fifth character in a symbol 
defined by .DALC and whose fourth character is 7., 
or C , 

Using S, C or P as the fourth character in a symbol 
defined by .DIO or .DIOA. 

Conversely if the programmer limits his symbols of these types 
to three characters , he can append the above letters to them to 
produce the same effects as with ALC and 10 mnemonics. (In 
fact the instruction mnemonics are not built into the assemblcr-- 
they arc defined by oseudo-ops.) 
Although symbols defined by .DUSR take no arguments, there is one 
property that all symbols defined hy these pscudo'Ops h^vc in common and 
that differentiates them from symbols defined by label and equivalence 
statements. All symbols defined by pseudo-ops become initial symbols, ie 
they become initial entries in the symbol table and can be used without 
definition by programs that are .-issemblcd after they ere defined. (This 
also means that a lat^^r program cannot use the same character string for 
come other purpose, eg as a label.) These symbols remain in the symbol 
table until the assembler is reloaded or the programmer expunges the table, 
Reloading the assembler reduces the tabic to its initi^^l state, in which 
it contains only the instruction mnumonics -■'nd the permanent symbols, ie 
the special location counter symbol (.) and the pseudo-ops. Hiving the 
pseudo-op / 
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undefines all but thv?. permanent symbols and recovers the space used by 
the expunged definitions. After expunging the table, the programmer can 
even define instruction mnemonics such as ADD and JMP in any way he wishes. 



SYMBOL OEFINliJG PSEUDO-OPS 



Symbol Type 

Pseudo-op Defined Arguments in Use 

numeric) any expression) 

. DMR i'iCmory reference l.fSjAddress L, Index J 

.DMP.A Mi,nory reference AC, C@]Address L, Index J 

with AC 

.DALC Arithmetic and [*JaCS, ./'CD [,Skin3 

logical class 

.DIG In-out Device 

.DIOA In-out with AC AC, Device 

•DIAC Instruction 

AT /;« KIt-c- •} s^A ii\ 
r\^ \»ii wiuj jf ciiu -1/ 
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14 OPERATING PROCEDURE 
To assemble a source program it is first necessary to load the object 
tape of the assembler (a tape is includerf in the standard ilOVA software 
package). Once loaded, the assembler automatically takes control and 
prints requests for various parameters on the teletype. The programmer 
supplies the necessary information by typing numerals back. 
The asseribler first types 

\\: 
in response to which the programmer identifi:-s the source input device by 
typing one of the following numerals. 

1 Teletype reader without parity checking 

2 Teletype reader with parity checking 

3 Paper tape reader without parity checking 
k Paper tape reader with parity checkinq 

5 Teletype keyboard without parity checking 

When parity is checked the assembler substitutes a backward slash {^) for 
any incorrect character and flags the line containing it for an input error 
(I). 
Mext 

LIST: 
requests the programmer to select the device on which the assembler is to 
list the source program. 

1 Teletype ASP33 (tabs and form feeds simulated) 

2 Teletype KSR3'^ 

3 Line printer 

k Paper tape punch with tape prepared for later 

1 isting on an AjR33 
5 Paper taps punch with tape prepared for later 

1 icf in.-! nn An A^R^^ 



After 

is typed, select the output device on which the object (binary) tape is 
to be punched, 

1 Teletype punch 

2 Paper tape punch 

The above responses identify the 10 devices to be used during 
assembly, and at this tine tiie source tape should be mounted on the 
selected input device. Tho essombler types out 

'"ODE: 
to determine what function to perform during the upcoming pass. 

1 Poss 1 (all symbols are defined) 

2 Pass 2 - Output an object tape 

3 Pass 2 - Output a listing (including an alphabetical 

symbol 1 ist) 

k Pass 2 - Output both an object tape end a listing 

5 Output an alphabetical symbol list 

Note that k is illegal if the orogrammer selected the same device in 

response to both BIN: and LIST:. \Jhen a pass is completed, the assembler 

again types ■ 

MODii: 

to reauest the next function to be performed, if any. 

If it is necessary at any time to select c. new 10 device, do the 

fol lowing: 

1 . Press RESET 

2. Set Q00002 into the data switches 

3. Press START 
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To reassign the mode, do this: 

1. Press RESET 

2. Set 000003 into the data switches 

3. ""ress START 

To save the symbol table {eg because new initial symbols have been 
defined), punch a ntw object tape of the i3ssembler itself after pass 1 as 
fol lows . 

1. Perform pass 1 on tho defining tape. 

2. V'hen the assembler finishes pass 1 it types out ''MODE". Respond by 
typing in '1 '. This causes the assembler to eliminate noninitial entries 
from the symbol table, and it then stops since there is no source tape in 
the reader. 

3. Using the Binary Punch Program iqv) .. punch the tape from location 
000002 to the location addressed by the contents of location 000004 
(location 000004 addresses the last location in the symbol tabic). 

4. Specify 000002 as the assembler start address to be punched in the 
start block at the end of the tape. 



APPENDIX A 
CHARACTERS 



Character 



7 Bit 
ASCI I 



Null 000 

Horizontal Tab Oil 
Line Feed 012 



Character 



5 

6 



Form Feed O^k 7 

Carriage Return 015 8 

Space 040 5 

: Oi*1 : 

042 ; 

# Ok3 < 
& 0A6 

" 052 > 

+ C53 @ 

054 A 

055 B 

056 c 
/ . 057 D 

060 E 

1 061 F 

2 062 G 

3 063 H 



7 Bit 
ASCII 



064 
065 
066 
067 
070 
071 



Character 



I 

J 
K 
L 



7 Bit 
ASCII 



072 

073 P 

074 Q 

075 R 

076 S 

100 T 

101 U 

102 V 

103 W 

104 X 

105 Y 

106 z 

107 Rub Out 177 
110 



111 
112 
113 

114 
115 
116 

1 1/ 
120 
121 
122 
123 
124 

125 
126 
127 
130 

131 

132 



APPENDIX B 
PSEUDO-OPS 



Mnemonic 




Effect 


.BLK 


Assign 


a blocl< of storage 


.DALC 


Define 


an arithmetic and logical Instruction 


.OIAC 


Define 


an instruction requiring an accumulator 


• DIO 


Define 


an input/output Instruction 



.DIOA 

.DMR 

.DMRA 

.DUSR 

.EHD 

.EOT 

.LOC 

.RDX 

.TXT 

.TXTE 

.TXTF 

.TXTM 

.TXTO 

.XPNG 



Define an input/output instruction requiring an accumulator 

Define a memory reference Instruction 

Define a memory reference Instruction requiring an 
accumulator 

Define a user symbol 

End of source input 

End of tape 

Assign a location counter value 

Change the number radix 

Define pacl<ed test strings in octal--force parity to 

Define packed text strings In octal --compute even parity 

Define pacl<cd text strings In octal--force parity to 1 

Define text packing mode 

Define packed text strings in octal --compute odd parity 

Expunge all but the permanent symbols from the symbol 
table 



,'^PPENDIX C 
SYMBOL TABLE 

All predefined and user defined symbols are entered in a table called 
the symbol table. This table is origined at the end of the assembler and 
is upward expandable until the memory capacity of the machine being used is 
exhausted. Each entry in the table occupies three l6-bit words. The maximum 
length of a stored symbol is five characters and is represented In radix 50n 
form. This method uses the first word to store the first three characters 
of the symbol and eleven bits of the second word to store the last two 
characters of the symbol. The five remaining bits of the second word are 
used to define attributes of the symbol, eg, a memory reference Instruction 
symbol. The third word is used to store the numeric value of the symbol. 

Symbol table capacity for 2 ^K system is approximately ^00 symbols. 

Radix 50 representation is \is^i to condense symbols of five characters 
into two words of storage using only 27 bits. Assume a symbol of the form; 

■^Zj °'3 ^2 "1 "^0 
01 ! mav be a - 7 (')f-\ 

0-9 (10) 
or . (1 ) 
All symbols are padded (if necessary) with nulls. Therefore, there an 
38 = k(:Q possible characters. Each character can be translated as follows: 



10 8 



character (a.) translation (g.) 
I I 

Null 00 

to 9 1 to 12 

A to Z 15 to kl^ 

45 



C2 

If a. tr^jnslatcs to g., wt can compute the following numbers: 

M^ = {{&l^ * 50)+63)"5G+62 
M^ = (e^ =^ 50)+3o 

H maximum is (50)^ -1 which equals I?'*??? and will take a maximum of 16 bits 
to represent. M^ maximum is (53)^ -1 which equals 3077 and will take 11 bits 
to represent. The symbol is thus represented by J!^ and H^ which take 27 bits 

of storage. 

A number of symbols exist which are permanently defined in the assembler. 
They cannct be eliminated by the .XPMG pseudo-op. These symbols are: 

.BLK .DMRA .TXT 

.OALC .DUSR .TXTE 

.DIAC .END .TXTF 

.DIO .EOT .TXTM 

.DIOA .LCC .TXTO 

. om . RDX . XPHG 

These symbols will never apoear in the symbol list following rnn assembly 
listing. Mete that a second class of symbols exists (initial symbols) which 
have been entered in the symbol table by the operator defining pseudo-ops 
(§12). All of the MOVA instruction mnemonics src in this category. They 
are never printed in the symbol list following an assembly listing. They 
can be eliminated, however, by using tho .XPMG pseudo-op. Care must be taken 
not to confuse this second class of symbols with permanent symbols when 
using the .XPMG pseudo-op. 



APPENDIX D 
ERROR MNEMONICS 

Extensive examination of statement syntax takes place during both 
passes of the assembly in order to detect syntactic errors in the input. A 
statement found to be in error will be flagged with from one to three letters 
indicating general classes in which the error(s) fall. Statements in error 
during pass 1 will be printed (with flags) on the teletype. Pfter pass 1 
the user may decide whether to continue to pass 2 or to correct any errors which 



mOVc occurrGu luUS isr. jtateiDents in error uurinij pass ^ wi ii ue printer on 
the teletype as well as flags appearing opposite the statements on the list- 
ing device ( i f any) . 

An alphabetical list of error codes along with examples of statements 
causing such errors is given on the next page. 



D2 



ERROR 
FLAG 


GENERAL CLASS 
OF PROBLE^i 


EXAMPLES - ( 


:owiEr!TS 




A 


Addrass error 




LDA 
ISZ 


0,^400 
.+317 




B 


!iad charactsr 


LA$L 


LDA 


1,23 


; $ ^JOT PE:1?',ITTED 


C 


Colon error 


A+2 : 






, MO EXPRESSION PERMITTED PEFOP.E 
; A COLOM 


D 


?,^dix error 




.R.DX 


12 


HAD IX 12 MOT PERMITTED 


E 


Equal error 


P,EG= 


3+B 




; WHERE D IS UHDEFIflED 


F 


Format error 




ADD 


2 


MEED AT LEAST 2 OPERAMDS 


1 


Input error 


; PA 


^ITY CHECKED ON \H 


3UT AND SOME CHARACTER. >.'AS \H ERROR 


L 


.LOC error 




.LOC 


-1 


SIT SET 


M 


f'ultiply defined 
symbol 


h: - 


3 
5 




• SYMBOL MAY APPEAR ONLY 
i Oi'.'CE IN LABEL FIELD 


N 


Number error 


C77: 


7A 




■ MO LETTERS PERMITTED IM A 
, !v'UilBE» 





Field overflew 




LDA 


4 . LOC 


MO REGISTER 'l 


P 


Phase error 


; v;. 


LUE OF 


A SYMBOL In 


PASS 1 DIFFERS FROM THAT OF PASS 2 


Q 


Questionable 
line 




. + .E 


:^D 




s 


Symtjol table 
overflow 




MORY c; 

ACHED 


PACITY FOR 


A CIVf..' KACHIHE HAS BEEtJ 


T 


Error in table 
pseudo-OD 




]h+. 


XPi'C 


MP EXPRESSION CEFORE A TABLE 
• f^SEUDO-OP 


U 


Undefined Symbol 


r 


SY:«30L 


IN OPE-;Ar'D 


FIELD VAS NEVER DEFINED 


X 


Taxt error 


LET. 


'X3 
3+.TXT 


• ONLY ONE CHARACTER IN " ATOM 
; MO EXPRESSION PERMISSIBLE 
BEFORE .TXT 



APPENDIX E 
LISTING FORMAT 

I SAMPLE ASSEMLY LISTING 





00000 


024002 


STRTI 


LDA 


\»»*2 








00001 


050000 




STA 


2#.-t 








00002 


157000 




ADD 


2#3 








00003 


014020 




DSE 


20 








00004 


170401 




NEG 


3#2«SKP 








00005 


042524 




.TXT 


♦TE 








00006 


052130 


XT 












00007 


005015 


<15><12 


> 










00010 


000000 

000040 
000002 
000005 


* 

ACNST* 
BCNST« 


40 

• RDX 
101 


e 








00011 


000135 
000010 

• 


CNSTt 


1011101 
• RDX 


8 






A 


00012 


020766 




LDA 


0«400 




ADDRESS ERROR 


A 


00013 


010717 




ISZ 


317«1 




ADDRESS ERROR 


UB 


00014 


024023 


LASLI 


LDA 


4<23 




BAD CHARACTER IN LABEL 


MC 






tK*2t 








MULTIPLY DEFINED AND 
COLON ERROR 


UUF 




REG = 


3 +8 






EQUIVALENCE CRROR 


F 


00015 


1 43000 




ADD 


fi 




FORMAT ERROR 


L 








.LOC 


-I 




LOCATION ERROR 


MP 


0Piei6 


000003 


A)3 








PHASE ERROR 


K 


0Ptei7 


000005 


At5' 








MULTIPLY DEFINED 


N 


00020 


000007 


C77l 


7A 






NUMBER ERROR . 





00021 


020016 




LDA 


4#.-3 




FIELD OVERFLOW 


R 








.RDX 


20 




RADIX ERROR 


T 






2*3«.DUSR 






SYMBOL TABLE ERRO 


U 


00022 


030015 




LDA 


2«B 




B IS UNDEFINED 


X 






3*.TXT*2 






TEXT ERROR 



Q 



.-^•END 



I QUESTIONABLE 



E2 



A 


000017 


ACNST 


000040 


B 


000015 


BCK'i-T 


!?000E5 


C77 


0000SP 


CKST 


00001 1 


LA 


000014 


lAL 


000014 


HEG 


000015 


STHT 


000000 



APPEilDIX F 
OBJECT TAPE FORMAT 



The output of the assembler is an object tape. Its format is acceptable 
as input to the binary loader. The tape is punched in blocks separated by 
null (all 0) characters. There arc three block types: data, start and 
error. The Loader reads two tape characters to form a l6-bit word. The 
format is as fol lows; 



tape channel 
B 7 G 5 <, 3 2 1 



o 
o 
o 



# 1 



# 2 



o 
o 



direction of motion 



78 



15 



# 2 



# 1 



word 



In other words, the first t.'p.; character forms bits C-15 of the data word 
(Channel 8 to bit 8, etc.) and the second tape character forms bits 0-7 
of the data word (Channel S to bit 0, etc.). The first non-null tape 
character signifies thc»t start cf a block. The block type is detc^rmined 
by the first word read. /' description of sach block type follows:" 
Data Block - Bit of first word is e 1. 

word 

I 

2 

3 

k 

5 

wc = n 

3 + n 

The two's complement of trie number of data words in the block is given in 
tht, first word (therefore, bit 3 is a l). \'ormTlly 1$ data vjcrds will be 
punched per data block. However, the .Z'W and . LOC pseudo-ops may cause 



-wc 


sddr. 


.^ss 




chect 


:sum 


data 


wd 


1 


data 


wd 


2 


; 


dst^ 


wd 


n 



F2 

short blocks to be punched. The second word contains the address at which 
the first d3t.3 word is to bi loaded. Subsequent data words are loaded in 
sequentially ascending locations. The third word contains a checksum. This 
number is such that the binary sum of all words in the block should give a 
zero result. The remaining words are the data to be loaded. 
Start 31ock • First word is 000001. 



0,1 



15 



000001 



address 



chc;c!;sum 



The first word contains 1. The second word uses the sign bit as ■? flag. 
If S==0, the loader will transfer to the address in tits 1-15 of the word. 
If S=l , the loader will holt. The third word checksum is the same as that 
for a dat? block. 
Error Block - First word > 1. 



1 



n^rbage 



The first word is greater thnn +1. 

An error hlock is ignored in its entirety by the loader. 

are terminated by a rubout. 



Al 1 error blocks 



